Data Augmentation - Concepedia

About

Data augmentation is a methodological approach in machine learning and data science that involves artificially increasing the size and diversity of a training dataset. This is achieved by applying various transformations or modifications to the original data instances, generating new, synthetic examples while preserving essential characteristics relevant to the learning task. The concept investigates techniques for generating valid variations of existing data to improve model generalization, robustness, and performance, particularly in scenarios with limited data availability or to mitigate overfitting by exposing models to a wider range of potential inputs. Its significance lies in being a fundamental strategy for enhancing the training of complex models, especially deep neural networks, across domains such as computer vision, natural language processing, and audio processing.

Top Publications

Rankings shown are based on citation count.

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift	2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification	2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift	2024
A survey on Image Data Augmentation for Deep Learning	2019
YOLOv4: Optimal Speed and Accuracy of Object Detection	2020